3574 results found.
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Arabic Chinese English
Availability:
From Data Center(s)
License:
LDC
Size:
303833 words Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Towards Few-Shot Event Mention Retrieval: An Evaluation Framework and A Siamese Network Approach
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bonan Min | ACE (Automatic Content Extraction) 2005 Corpus | /N |
Documentation:
Yes. English. Yes.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
CreativeCommons
Size:
6639 sentences Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:SC-CoMIcs: A Superconductivity Corpus for Materials Informatics
-
Paper track:Terminology/poster presentation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kyosuke Yamaguchi | SC-CoMIcs | /N |
Documentation:
An English annotation guideline is under preparation
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Haiyue Song | Asian Scientific Paper Excerpt Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:On the Correlation of Word Embedding Evaluation Metrics
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | François Torregrossa | MSR | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Haiyue Song | TED | /N |
Documentation:
None
Tokenizer,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
-
Paper title:Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Haiyue Song | NLTK | /N |
Documentation:
None
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
-
Paper title:Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Haiyue Song | NLPL word embeddings repository | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Dialogue
-
Paper title:The Margarita Dialogue Corpus: A Data Set for Time-Offset Interactions and Unstructured Dialogue Systems
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nizar Habash | Margarita Dialogue Corpus | /N |
Documentation:
Yes
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Size:
499 entries Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Using Deep Neural Networks with Intra- and Inter-Sentence Context to Classify Suicidal Behaviour
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xingyi Song | CRIS Suicidal Behaviour Corpus | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Multilingual
Languages:
Dutch English French
Availability:
From Owner
License:
Size:
51220 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Identifying Cognates in English-Dutch and French-Dutch by means of Orthographic Information and Cross-lingual Word Embeddings
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Els Lefever | Gold Standard for Cognate Pairs in English-Dutch and French-Dutch | /N |
Documentation:
Labat, S., Vandevoorde, L., and Lefever, E. (2019). Annotation Guidelines for Labeling English-Dutch Cognate Pairs, version 1.0. Technical report, Ghent University, LT3 15-01




